NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Parallel Dynamic Spatial Indexes

https://doi.org/10.1145/3774934.3786412

Men, Ziyang; Huang, Bo; Gu, Yan; Sun, Yihan (January 2026, ACM)

Full Text Available
Parallel Point-to-Point Shortest Paths and Batch Queries

https://doi.org/10.1145/3694906.3743311

Dong, Xiaojun; Li, Andy; Gu, Yan; Sun, Yihan (July 2025, ACM)

Full Text Available
Parallel k -Core Decomposition: Theory and Practice

https://doi.org/10.1145/3725332

Liu, Youzhe; Dong, Xiaojun; Gu, Yan; Sun, Yihan (June 2025, Proceedings of the ACM on Management of Data)

This paper proposes efficient solutions for k-core decomposition with high parallelism. The problem of k-core decomposition is fundamental in graph analysis and has applications across various domains. However, existing algorithms face significant challenges in achieving work-efficiency in theory and/or high parallelism in practice, and suffer from various performance bottlenecks. We present a simple, work-efficient parallel framework for k-core decomposition that is easy to implement and adaptable to various strategies for improving work-efficiency. We introduce two techniques to enhance parallelism: a sampling scheme to reduce contention on high-degree vertices, and vertical granularity control (VGC) to mitigate scheduling overhead for low-degree vertices. Furthermore, we design a hierarchical bucket structure to optimize performance for graphs with high coreness values. We evaluate our algorithm on a diverse set of real-world and synthetic graphs. Compared to state-of-the-art parallel algorithms, including ParK, PKC, and Julienne, our approach demonstrates superior performance on 23 out of 25 graphs when tested on a 96-core machine. Our algorithm shows speedups of up to 315× over ParK, 33.4× over PKC, and 52.5× over Julienne.
more » « less
Full Text Available
Parallel Contraction Hierarchies Can Be Efficient and Scalable

https://doi.org/10.1145/3721145.3725744

Wan, Zijin; Dong, Xiaojun; Wang, Letong; Zhu, Enzuo; Gu, Yan; Sun, Yihan (June 2025, ACM)

Full Text Available
Parallel kd-tree with Batch Updates

https://doi.org/10.1145/3709712

Men, Ziyang; Shen, Zheqi; Gu, Yan; Sun, Yihan (February 2025, Proceedings of the ACM on Management of Data)

The kd-tree is one of the most widely used data structures to manage multi-dimensional data. Due to the ever-growing data volume, it is imperative to consider parallelism in kd-trees. However, we observed challenges in existing parallel kd-tree implementations, for both constructions and updates. The goal of this paper is to develop efficient in-memory kd-trees by supporting high parallelism and cache-efficiency. We propose the Pkd-tree (Parallel kd-tree), a parallel kd-tree that is efficient both in theory and in practice. The Pkd-tree supports parallel tree construction, batch update (insertion and deletion), and various queries including k-nearest neighbor search, range query, and range count. We proved that our algorithms have strong theoretical bounds in work (sequential time complexity), span (parallelism), and cache complexity. Our key techniques include 1) an efficient construction algorithm that optimizes work, span, and cache complexity simultaneously, and 2) reconstruction-based update algorithms that guarantee the tree to be weight-balanced. With the new algorithmic insights and careful engineering effort, we achieved a highly optimized implementation of the Pkd-tree. We tested Pkd-tree with various synthetic and real-world datasets, including both uniform and highly skewed data. We compare the Pkd-tree with state-of-the-art parallel kd-tree implementations. In all tests, with better or competitive query performance, Pkd-tree is much faster in construction and updates consistently than all baselines. We released our code.
more » « less
Full Text Available
New Algorithms for Incremental Minimum Spanning Trees and Temporal Graph Applications

https://doi.org/10.1137/1.9781611978759.22

Ding, Xiangyun; Gu, Yan; Sun, Yihan (January 2025, Society for Industrial and Applied Mathematics)

Full Text Available
Parallel Cluster-BFS and Applications to Shortest Paths

https://doi.org/10.1137/1.9781611978339.4

Wang, Letong; Blelloch, Guy; Gu, Yan; Sun, Yihan (January 2025, Society for Industrial and Applied Mathematics)

Full Text Available
Parallel Joinable B-Trees in the Fork-Join I/O Model

https://doi.org/10.4230/lipics.isaac.2025.37

Goodrich, Michael T; Gu, Yan; Kitagawa, Ryuto; Sun, Yihan (January 2025, Schloss Dagstuhl – Leibniz-Zentrum für Informatik)
Chen, Ho-Lin; Hon, Wing-Kai; Tsai, Meng-Tsung (Ed.)
Balanced search trees are widely used in computer science to efficiently maintain dynamic ordered data. To support efficient set operations (e.g., union, intersection, difference) using trees, the join-based framework is widely studied. This framework has received particular attention in the parallel setting, and has been shown to be effective in enabling simple and theoretically efficient set operations on trees. Despite the widespread adoption of parallel join-based trees, a major drawback of previous work on such data structures is the inefficiency of their input/output (I/O) access patterns. Some recent work (e.g., C-trees and PaC-trees) focused on more I/O-friendly implementations of these algorithms. Surprisingly, however, there have been no results on bounding the I/O-costs for these algorithms. It remains open whether these algorithms can provide tight, provable guarantees in I/O-costs on trees. This paper studies efficient parallel algorithms for set operations based on search tree algorithms using a join-based framework, with a special focus on achieving I/O efficiency in these algorithms. To better capture the I/O-efficiency in these algorithms in parallel, we introduce a new computational model, the Fork-Join I/O Model, to measure the I/O costs in fork-join parallelism. This model measures the total block transfers (I/O work) and their critical path (I/O span). Under this model, we propose our new solution based on B-trees. Our parallel algorithm computes the union, intersection, and difference of two B-trees with O(m log_B(n/m)) I/O work and O(log_B m ⋅ log₂ log_B n + log_B n) I/O span, where n and m ≤ n are the sizes of the two trees, and B is the block size.
more » « less
Full Text Available
Parallel and (Nearly) Work-Efficient Dynamic Programming

https://doi.org/10.1145/3626183.3659958

Ding, Xiangyun; Gu, Yan; Sun, Yihan (June 2024, ACM)

Full Text Available
Brief Announcement: PASGAL: Parallel And Scalable Graph Algorithm Library

https://doi.org/10.1145/3626183.3660258

Dong, Xiaojun; Gu, Yan; Sun, Yihan; Wang, Letong (June 2024, ACM)

Full Text Available

« Prev Next »

Search for: All records